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ENGINEERING OF METABOLIC CONTROL 

Background of the Invention 

The use of recombinant DNA technology has allowed the engineering of host cells to 
produce desired compounds, such as polypeptides and secondary metabolites. The large 
5 scale production of polypeptides in engineered cells allows for the production of proteins 
with pharmaceutical uses and enzymes with industrial uses. Secondary metabolites are 
products derived from nature that have long been known for their biological and medicinal 
importance. Because of the structural complexity inherent in such molecules, traditional 
chemical synthesis often requires extensive effort and the use of expensive precursors and 
10 cofactors to prepare the compound. In recent years, the expression of heterologous proteins 
in cells has facilitated the engineering of heterologous biosynthetic pathways in 
microorganisms to produce metabolites from inexpensive starting materials. In this manner, 
a variety of compounds have been produced, including polyketides, (J-lactam antibiotics, 
monoterpenes, steroids, and aromatics. 

1 5 Summary of the Invention 

The invention is based, in part, on the discovery that production of heterologous 
polypeptides and metabolites can be enhanced by the regulated expression of the polypeptide 
(e.g., a biosynthetic enzyme) using a promoter which is regulated by the concentrations of a 
second metabolite, e.g. acetyl phosphate. The term "heterologous" refers to a polypeptide or 
20 metabolite which is introduced by artifice. A heterologous polypeptide or metabolite can be 
identical to endogenous entity that is naturally present. The term "metabolite" refers to a 
organic compound which is the product of one or more biochemical reactions. A metabolite 
may itself be a precursor for other reactions. A secondary metabolite is a metabolite derived 
from another. 

25 Accordingly, in one aspect, the invention features a bacterial host cell containing a 

nucleic acid sequence comprising a promoter and a nucleic acid sequence encoding a 
heterologous polypeptide. Examples of bacterial host cells include Escherichia coli, Bacillus 
subtilis. Salmonella typhimurium, Agrobacterium tumefaciens , Thermus thermophilus t and 
Rhizobium leguminosarum cells. The nucleic acid sequence is operably linked to the 

30 promoter which is controlled by a response regulator protein. In other words, the nucleic acid 
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sequence is linked to the promoter sequence in a manner which allows for expression of the 
nucleotide sequence in vitro and in vivo. "Promoter" refers to any DNA fragment which 
directs transcription of genetic material. The promoter is controlled by a response regulator 
protein, for example, ntrC, phoB, phoP, ompR, cheY, creB, or torR, of E. coli or its homologs 
5 from other bacterial species. Further, the response regulator protein can be another member 
of the cluster orthologous group (COG) COG0745 as defined by 

http://www.ncbi.nlm.nih.gov/COG/ (Tatusov et al. Nucleic Acids Res. (2000); 28:33-36). In 
one implementation, the promoter is bound by E. coli ntrC. The term "ntrC" refers to both 
the E. coli ntrC protein (SWISSPROT : P06713, http://www.expasy.ch/) and its homologs in 

10 other bacteria as appropriate. As used herein, "bound" refers to a physical association with a 
equilibrium binding constant (K D ) of less than 100 nM, preferably less than 1 nM. An 
example of the promoter is the E. coli glnApj promoter, e.g. a region between positions about 
93 and about 323 in the published DNA sequence, GenBank accession no. M10421(Reitzer 
& Magasanik (1985) Proc Nat Acad Sci USA 82:1979-1983). This region includes 

1 5 untranslated sequences from the glnA gene. Further, a translational fusion can be constructed 
between coding sequences for glnA and coding sequences for the heterologous polypeptide. 

The host cell is genetically modified such that the promoter is regulated by acetyl 
phosphate in the absence of nitrogen starvation. For example, the host cell can genetically 
modified by deletion or mutation of a gene encoding a histidine protein kinase, e.g., a 

20 member of COG0642 as defined by (http://www.ncbi.nlm.nih.gov/COG/, Tatusov et al. 
supra.), e.g., glnL, phoR, phoQ, creC, or envZ. In another example, the histidine protein 
kinase has specificity for the response regulator protein which controls the promoter. The 
histidine protein kinase can be encoded by glnL, e.g., E. coli glnL (SWISSPROT P06712; 
http://www.expasy.ch/). 

25 Whereas the host cell is genetically modified such that the promoter is regulated by 

acetyl phosphate in the absence of nitrogen starvation, for heterologous polypeptide or 
metabolite expression, the host cell can be propagated in any desired condition, e.g., in 
nitrogen starvation conditions, nitrogen poor conditions, or nitrogen rich conditions. 
The heterologous polypeptide encoded by the nucleic acid sequence can be a 

30 biosynthetic enzyme required for production of a metabolite. It can be a mammalian protein, 
e.g., a secreted growth factor, a monoclonal antibody, or an extracellular matrix component. 
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In yet another example, the heterologous polypeptide can be a desired antigen for use in a 
vaccine, e.g., a surface protein from a viral, bacterial, fungal, or protist pathogen. 

Another aspect of the invention features a kit containing a nucleic acid sequence 
which includes a promoter controlled by a response regulator protein. The kit further 
optionally contains a bacterial host cell which is genetically modified such that the promoter 
is regulated by acetyl phosphate in the absence of nitrogen starvation. The kit can also 
provide instructions for their use. The nucleic acid sequence can contain a restriction enzyme 
poly linker located 3' of the promoter such that a sequence inserted into the polylinker is 
operably linked to the promoter which is controlled by a response regulator protein. In one 
implementation of the kit, the promoter is the E. coli glnAp 2 promoter and the bacterial host 
cell is an E. coli cell containing a mutation or deletion of the glnL gene. 

Another aspect of the invention features a host cell containing a first expression 
cassette. The first expression cassette includes a promoter, such as any of those described 
above, and a nucleic acid sequence encoding an enzyme required for biosynthesis of a 
heterologous metabolite. As used herein, "enzyme" refers to a polypeptide having ability to 
catalyze a chemical reaction or multiple reactions. The nucleic acid sequence is operably 
linked to the promoter which is regulated by acetyl phosphate in the absence of nitrogen 
starvation. The host cell also contains additional nucleic acid sequences for expressing other 
enzymes required for biosynthesis of the metabolite. Such additional sequences may be 
endogenous sequences expressing endogenous enzymes, or introduced sequences expressing 
heterologous enzymes. 

In one example, the heterologous metabolite is an isoprenoid, a 
polyhydroxyalkanoate, a polyketide, a 0-lactam antibiotic, an aromatic, or a precursor, e.g., 
an upstream metabolite, or a derivative, e.g., a downstream metabolite, thereof. For instance, 

25 the isoprenoid can be a carotenoid, a sterol, a taxol, a diterpene, a gibberellin, and a quinone. 
Specific examples of isoprenoids include isopentyl diphosphate, dimethylallyl diphosphate, 
geranyl diphosphate, famesyl diphosphate, geranylgeranyl diphosphate, and phytoene. 
Specific examples of carotenoids include (3-carotene, C-carotene, astaxanthin, zeaxanthin, 
zeaxanthin-p-glucoside, phytofluene, neurosporene, lutein, and torulene. When the desired 

30 heterologous metabolite is an isoprenoid, the heterologous enzyme can be isopentenyl 
diphosphate isomerase, geranylgeranyl diphosphate synthase, or 1 -deoxyxylulose 5- 
phosphate synthase. When the desired heterologous metabolite is an polyhydroxyalkanoate, 
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the heterologous enzyme can be 3-ketoacyl reductase, or poly-3-hydroxyalkanoate 
polymerase. 

The host cell can be a bacterial cell, e.g., an E. coli cell. The host cell is optionally 
genetically modified by deletion or mutation of a gene, e.g., a gene encoding a histidine 
5 protein kinase, as described above. In one specific example, the host cell further contains a 
second expression cassette containing a nucleic acid sequence encoding phosphoenolpyruvate 
synthase operably linked to a promoter regulated by acetyl phosphate concentration, e.g., 
glnApj. 

Another aspect of invention features a method of producing heterologous isoprenoids 

10 in a host cell. The method includes overexpressing phosphoenolpyruvate synthase and 

expressing biosynthetic enzymes required for synthesis of the heterologous isoprenoid. In 
one implementation, a gene in the host cell encoding a pyruvate kinase or a 
phosphoenolpyruvate carboxylase is genetically deleted or enfeebled. In another 
implementation, a gene encoding phosphoenolpyruvate carboxykinase is overexpressed in the 

1 5 host cell. Still another aspect of the invention features a method of producing a lycopene in a 
host cell. The method includes expressing the following heterologous enzymes: 1-deoxy-D- 
xylulose 5 -phosphate synthase, a geranylgeranyl diphosphate synthase, a phytoene synthase, 
and a phytoene saturase. In one implementation of this method, an isopentenyl diphosphate 
isomerase is overexpressed, e.g., using the glnAp2 promoter. In another implementation, a 

20 phosphoenolpyruvate synthase is overexpressed, e.g., using the glnAp2 promoter. 

Another aspect of the invention features a nucleic acid sequence containing a 
promoter and a sequence encoding a biosynthetic enzyme required for the production of a 
first metabolite. The promoter is operably linked to the sequence, and is regulated by a 
second metabolite whose concentration is indicative of availability of a precursor for the 

25 biosynthesis of the first metabolite. In one example, the second metabolite is a waste product 
produced from a precursor for the biosynthesis of the first metabolite. 

In one implementation, the first metabolite is a polyhydroxyalkanoate, e.g., 
polyhydroxybutyrate and the nucleic acid sequence encodes a biosynthetic enzyme, e.g., a 3- 
ketoacyl coenzyme A (coA) reductases, or a poly-3-hydroxyoctanoyl-CoA polymerase. In 

30 another case, the first metabolite is a polyketide, a 3-lactam antibiotic, or an aromatic. In a 
yet another case, the first metabolite is an isoprenoid, e.g., an isoprenoid mentioned herein. 
The nucleic acid sequence can encode a biosynthetic enzyme required for isoprenoid 
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production, e.g., isopentenyl diphosphate isomerase, geranylgeranyl diphosphate synthase, 1- 
deoxyxylulose 5-phosphate synthase, phosphoenolpyruvate synthase, farnesyl diphosphate 
synthase, geranylgeranyl diphosphate synthase, phytoene synthase, phytoene desaturase, or 
lycopene cyclase. One precursor of isoprenoids can be pyruvate. Pyruvate concentrations 
5 are related to acetate and acetyl-phosphate concentrations. Accordingly, in this instance, the 
second metabolite is acetyl phosphate. The promoter responding to acetyl phosphate can be 
controlled by a response regulator protein, e.g., a response regulator protein mentioned 
above. Such a promoter may only respond to acetyl phosphate in a specific host cell. In a 
particular example, the promoter responding to acetyl phosphate concentration is bound by E. 

10 coli ntrC, e.g., E. coli glnAp: promoter. 

The promoter can be regulated by cAMP. The promoter can be a bacterial promoter 
which binds CAP (catabolite activator protein). In mammals, the promoter can be a promoter 
containing a cAMP response element (CRE), which binds to the proteins CREB, CREM, or 
ATF-1 . In yeast cells, the promoter can be a promoter regulated by cAMP, or a promoter 

15 bound by proteins Gisl, Msn2, or Msn4. Another possible regulatory signal for the promoter 
can be fructose 1 -phosphate, or fructose 6-phosphate. The E. coli FruR protein regulates 
such promoters. 

The nucleic acid sequence can be contained on a plasmid. It can also contain a 
bacterial origin of replication and a selectable marker. The sequence can further contain a 

20 yeast or other eukaryotic origin of replication and appropriate selectable markers, and can be 
integrated into the genome. 

The optimization of biosynthesis of heterologous compounds in host cells is reliant on 
sensing parameters of cell physiology and on utilizing these parameters to regulate the 
biosynthesis. One standard techniques in the art is to grow cells and for the user to 

25 exogenously add an agent, e.g., an inducer, to turn on genes required for biosynthesis of the 
desired product. It has been widely observed that high-level induction of a recombinant 
protein or pathway leads to growth retardation and reduced metabolic activity. (Kurland and 
Dong ( 1 996) Mol Microbiol 21:1-4). The practice of exogenously supplying an inducer is 
empirical and does not monitor the availability of resources in the cell for biosynthesis. In 

30 contrast, natural pathways rely on feedback mechanisms to control such processes. The 
combination of certain promoters with specific genetically defined host cells and 
heterologous polypeptides in this invention unexpectedly results in a highly refined and 
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versatile control circuit that regulates flux to heterologous polypeptide or metabolite 
synthesis in response to the metabolic state of the cell. Indeed, the dynamically controlled 
recombinant pathway provides for enhanced production, minimized growth retardation, and 
reduced toxic by-product formation. The regulation of gene expression in response to 
5 physiological state will also benefit other applications, such as gene therapy. 

The details of one or more embodiments of the invention are set forth in the 
description below. Other features, objects, and advantages of the invention will be apparent 
from the description and from the claims. 

1 0 Detailed Description 

The invention provides methods of engineering metabolic control, e.g., methods of 
utilizing promoters in specific host cells in order to optimize protein expression for either 
protein production or metabolite synthesis. 



15 and nucleic acid sequence encoding a heterologous polypeptide whose expression is desired. 
The expression cassette is constructed using standard methods in the art such that the coding 
nucleic acid sequence is operably linked, e.g., regulated by, the promoter. The promoter is 
chosen such that the promoter is regulated by a parameter of cell physiology or cell metabolic 
state. A variety of promoters can be used. In some applications the expression cassette is 

20 contained within a plasmid, such as bacterial plasmid with a bacterial origin of replication 
and a selectable marker. The expression cassette can be integrated into the genome of cells 
using standard techniques in the art. 

If the expression cassette is to be used for engineering regulated production of a 
heterologous polypeptide during late logarithmic growth or during stationary phase, then the 

25 promoter can be chosen accordingly. For example, a promoter can be chosen that responds to 
small molecule signal, e.g., a second messenger, whose levels accumulate during late 
logarithmic growth or during stationary phase. The second messenger can be a molecule that 
accumulates as a precursor, an intermediate, or a waste product of a biochemical pathway. In 
bacteria, the small molecule signal can be a glycolysis intermediate, e.g., fructose 1- 

30 phosphate or fructose 6-phosphate or a glycolysis waste product, e.g., acetate or acetyl 



A central component of the invention is an expression cassette comprising a promoter 
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phosphate. In eukaryotic cells, cAMP concentrations are a well known signal of nutrient 
state. 

The promoter in the expression cassette can be chosen based on the results of a large 
scale expression analysis experiment, e.g., a gene chip experiment. Genes which are induced 
by acetyl phosphate can be identified by hybridizing to a microarray labeled cDNA prepared 
from cells in grown in acetate and comparing the signal to a reference signal, e.g., to the 
signal of obtained with cDNA prepared from cells in early logarithmic growth. This 
experiment can be performed on both prokaryotic and eukaryotic cells, e.g., bacterial, yeast, 
plant and mammalian cells. For an example of such an experiment in a prokaryote, see 
Talaat et al. (2000) Nat Biotechnol 1 8:679-82 and Oh & Liao (2000) Biotechnol Prog. 
16:278-86. Once a gene is identified which is expressed under the desired condition, its 
promoter can utilized in the expression cassette. Alternatively, the experiment can be 
performed by the exogenous addition of a desired molecule (e.g., a precursor in a metabolic 
pathway) or by manipulation of experimental conditions (e.g., growth to late logarithmic 
phase or growth while a biosynthetic enzyme is overproduced). Promoters can be identified 
based on the genes induced. 

In one instance, an expression cassette is used for engineering regulated production of 
a metabolite in a bacterial cell. The promoter can be selected which is regulated by a second 
metabolite whose concentration is indicative of the availability of a precursor for the 
biosynthesis of the first metabolite. For example, if the first metabolite is an isoprenoid 
which is synthesized from the precursors, pyruvate and glyceraldhyde 3-phosphate, then the 
second metabolite can be acetyl phosphate. In a rich environment, cells produce an excess 
amount of acetyl-CoA, a product of pyruvate. The excess acetyl-CoA is used to produce 
ATP and acetate, which is secreted as a waste product. Acetate concentration increases with 
cell density. Acetate, acetyl-CoA, and acetyl-phosphate concentrations are interrelated by to 
the following biochemical reactions: 

(1) acetyl-CoA + Pi acetyl phosphate + CoA 

(2) acetyl phosphate + ADP acetate + ATP 

Thus, high acetyl phosphate concentration is indicative of excess acetyl-CoA and 
excess pyruvate. A host cell which is genetically modified by deletion or mutation of glnL, 
for example, causes ntrC function to become acetyl phosphate dependent (Feng et al. (1992) 
JBacteriol 174:6061-6070). In this fashion, a promoter regulated by ntrC, e.g., the glnAp2 
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promoter, can be used to control gene expression in response to acetyl phosphate. The 
glnAp2 promoter can be obtained using standard techniques in the art. For example, primers 
can be designed and synthesized that anneal to the glnAp2 promoter. The polymerase chain 
reaction (PCR) can be used to amplify a nucleic acid fragment containing the glnAp2 
5 promoter. This fragment can now be used for further constructions. Likewise, an E. coli 
strain containing deletion of histidine protein kinase gene, e.g., glnL can be easily prepared. 
See Link et al. (1997) J Bacteriol. 1 79(20):6228-6237 for a detailed description of one 
possible method. The sequences encoding a desired heterologous polypeptide can be cloned 
downstream of the glnAp2 promoter so that it is operably linked to the promoter. A host cell 
10 with an inactivated glnL gene can then be transformed with the sequences. The transformed 
strain can be grown, and polypeptide production monitored during the course of growth. 
Robust protein expression can be observed at high cell densities, as in Farmer and Liao 
(2000) Nat. Biotechnol 1 8:533-537, the contents of which are hereby incorporated by 
reference. 

1 5 A mammalian cell can be used as a host cell for polypeptide or metabolite production. 

A promoter can be selected, e.g., a promoter that responds to cAMP. Such a promoter can 
contain a cAMP response element (CRE), which binds to the proteins CREB, CREM, or 
ATF-1 . Using standard techniques in the art, a desired coding sequence can be placed under 
control of the promoter and transformed into the mammalian cell. In some instances, the 

20 construction can be inserted into a virus, e.g., an inactivated virus. Such implementations 
allow for the regulated production of a protein or a metabolite produced by a heterologous 
biosynthetic enzyme in a gene therapy scenario. Plant cells can also be used as host cells. 
Again, an appropriate promoter can be chosen, e.g., a promoter than responds to a plant 
hormone, metabolite, or a precursor for the production of a desired metabolite. A promoter 

25 can be identified by a microarray experiment. After fusion of a desired promoter to a desired 
coding sequence in an appropriate vector, the construction can be electroporated into 
Agrobacterium tumefaciens and then used to transform plant cells using standard methods in 
the art. In still another example, yeast cells can be manipulated to express heterologous 
polypeptides or metabolites under metabolic control. For example, a Saccharomyces 

30 cerevisiae promoter can be a promoter regulated by cAMP, e.g., a promoter bound by 

proteins Gisl, Msn2, or Msn4. The regulation of all yeast genes in response to a variety of 
metabolic conditions is increasingly well studied. For example, DeRisi et ai (1997) Science 
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278:680-686 describe experiments following the transcriptional profile of nearly the entire 
Saccharomyces cerevisiae gene set under various metabolic conditions. Promoters regulated 
by a desired metabolite can be selected based on such data. The generation of yeast plasmids 
and the transformation of yeast are well known in the art. 
5 A variety of metabolic pathways can be reconstructed using the expression techniques 

described above. For example, a pathway to produce lycopene can be introduced in E. coli 
by constructing expression vectors for the following genes: dxs (coding for 1-deoxy-D- 
xylulose 5-phosphate synthase) from E. coli, gps (coding for geranylgeranyl diphosphate 
(GGPP) synthase) from Archaeoglobus fulgidus, and crtBI (coding for phytoene synthase and 
10 desaturase, respectively) from Erwinia uredovora. These genes can reside on a single or 
multiple plasmids, or can be integrated into the E. coli chromosome. In addition, 
phosphoenolpyruvate synthase can be overexpressed using any method, e.g., by fusion to the 
glnAp2 promoter. Isopentyl diphosphate isomerase can be overexpressed using any method, 
e.g., by fusion to the glnAp2 promoter. 

15 In another example, a pathway to produce polyhydroxyalkanoates (PHA), e.g., 

polyhydroxybutyrate can be implemented in E. coli. PHA is a family of linear polyesters of 
hydroxy acids with a variety of thermoplastic properties and commercial uses. Pseudomonas 
aeruginosa genes encoding 3-ketoacyI coenzyme A reductases and poly-3 -hydroxy alkanoate 
polymerase can be placed under regulation of a desired promoter, e.g., glnAp2, since acetyl- 

20 CoA levels can be indicative of precursor availability for PHA synthesis. 

Without further elaboration, it is believed that the above description has adequately 
enabled the present invention. The following examples are, therefore to be construed as 
merely illustrative, and not limitative of the remainder of the disclosure in any way 
whatsoever. All publications cited herein are hereby incorporated by reference in their 

25 entirety. 

Methods 

Growth conditions. All E. coli strains were grown in shake flasks containing the designated 
medium at 37°C in waterbath shakers (Model G76; New Brunswick Scientific, Edison, NJ). 
The cultures were grown in minimal media consisting of either M9 defined salts 34 
30 containing 0.5% (wt/vol) glucose or YE defined salts containing 1 .5% (wt/vol) glucose. YE 
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defined salts consisted of (per liter) 14 g K 2 HP0 4 , 16 g KH 2 P0 4 , 5 g (NH 4 ) 2 S0 4 , 1 g MgS0 4 , 
and 1 mg thiamine. Cell turbidity was monitored spectrophotometrically at 550 nm. 
Metabolite measurements. Acetate, pyruvate, and other organic acids were measured using 
HPLC (Constametric 3500 Solvent Deliver}' System and Spectromonitor 3 1 00 Variable 
5 Wavelength Detector; LDC Analytical, Riviera Beach, FL) over an organic acids column 
(Aminex HPX-87H, Bio-Rad Laboratories, Hercules, CA) maintained at 65°C. The mobile 
phase consisted of 0.01 N H2S04, and its flow rate was kept at 0.6 ml min' 1 . Peaks coming 
off the column were detected at 210 nm. Glucose was measured using Sigma kit no. 315-100. 
To quantify lycopene, 1 ml of bacterial culture was extracted with acetone, centrifuged, and 
10 the supernatant absorbance was measured at 474 nm. Lycopene concentrations were 
calculated by comparing absorbances to a standard curve. 

SDS-PAGE and enzyme assays. The protocol for SDS-PAGE is as described by Laemmli 
(1970) Nature 227:680-685. Measurement of P-galactosidase activity was carried out 
essentially as described by Miller (1992) A Short Course in Bacterial Genetics, Cold Spring 
1 5 Harbor Laboratory Press, Cold Spring Harbor NY. 

Results 

Usage of the glnAP2 promoter in E. coli in a heterologous fusion to lacZ. 

Increasing levels of acetyl phosphate can be an indicator of excess glucose flux. The 
current invention features host cells, nucleic acids sequences, and methods of utilizing acetyl 

20 phosphate as a signal to regulate the expression of rate-controlling enzymes in a desired 

metabolic pathway, both to utilize fully the excess carbon flux and to redirect the flux away 
from the toxic product, acetate. 

In order to examine the potential of glnAp2 as a dynamic controller of product 
expression, a nucleic acid sequence was constructed containing a heterologous lacZ gene 

25 operably linked to the glnAp: promoter. The glnAp2 promoter region containing the promoter 
and two ntrC -binding sites can be easily obtained by standard methods known in the art. The 
glnAp2 promoter was PCR-amplified from £. coli genomic DNA using the forward primer 
5'-CAGCTGCAAAGGTCATTGCACCAAC (containing an engineered PvwII site) and the 
reverse primer 5'-GGTACCAGTACGT-GTTCAGCGGACATAC (containing an engineered 

30 Kpnl site). These two primers amplified a region between positions 93 and 343 in the 
published DNA sequence 16 (GenBank accession No. Ml 0421). 
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The glnAp2 PCR fragment was also cloned into the EcoRI site of pRS55 1 , thus 
generating p2GFPuv, which contains glnAp2 in front of a promoterless lacZ gene. The 
glnAp2-lacZ region was transferred to >.RS45 via homologous recombination (Simons et 
a/.(1987) Gene 53:85-96), generating phage Xp2GFPuv. JCL1595 and JCL1596 were 
5 constructed by integrating a glnAp2-lacZ fusion via infection (Silhavy et al. (1984) 

Experiments with Gene Fusions, Cold Spring Harbor Laboratory Press, Cold Spring Harbor 
NY) with A.p2GFPuv phage into the chromosomes of BW1371 1 (lacX74) and BW18302 
{lacX glnL2001 ; Feng et al. supra), respectively. This strain contains the glnL2001 allele, 
which consists of an internal deletion between codons 23 and 182 of the glnL coding 

10 sequence and presumably results in a null mutation (Feng et al. supra). 

The time course of the P-galactosidase ((3-gal) activity was measured in wild-type and 
in the glnL mutant. The glnAp2-\5-ga\ activity increases in a time-dependent fashion similar to 
the excreted acetate concentration from the glnL host (JCL1596), whereas no induction of 
promoter activity was found for the isogenic wild-type control (JCL1595). 

15 Table 1. P-galactosidase activity of glnAp2-lacZ 





P-galactosidase activity 




(nmol/min-mg protein) 




6 hours 


1 1 hours 


glnA P 2-lacZ in WT (JCL1595) 


<100 


-100 


glnAp2-lacZ in glnL (JCL1 596) 


-700 


-1500 


Piac-lacZ in (VJS632) 


-500 


-550 



Thus, in the absence of glnL, glnAp2 is capable of responding to the excess carbon 
flux that is indicated by acetate excretion. As the cells approached the late-exponential phase, 
the biosynthetic requirement decreased and the cells began to exhibit an excess carbon flux, 
as demonstrated by the increased generation of acetate. At this point, at approximately 6 

20 hours, unexpectedly glnAp2-fi-ga\ activity began to rise to (-700 nmol/min-mg protein, see 
Table 1) whereas glnAp2-$-ga\ activity in the wild-type strain (JCL1595) was relatively low 
and remained constant throughout (-100 nmol/min-mg protein, Table 1). After more than 10 
hours, glnAp2-fi-ga\ activity in the absence of glnL was a remarkable -1500 nmol/min-mg 
protein (Table 1). The induction profile oiglnAp2 is also in stark contrast to that of the lac 

25 promoter (P/„ c )- Chromosomal Pi ac activity in strain VJS632 (/ac + ) rapidly increased after 
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induction with IPTG (isopropyl-P-D-thio-galactopyranoside) and achieved a constant level of 
expression in the cell (-550 nmol/min-mg protein, see Table 1), which is independent of the 
growth phase. 

5 Usage of the glnAP2 promoter in E. coli in a heterologous fusion to pps and aroG 

Expression of two different metabolic enzymes, phosphoenolpyruvate synthase (pps) 
and 3-deoxy-D-arabinoheptulosonate 7 -phosphate (DAHP) synthase (aroG) were placed 
under the control of the glnAp2 promoter. As controls, these same two proteins also were 
overexpressed from the tac promoter (P mc ), which exhibits static control, under the same 

10 genetic background and environmental conditions. Standard methods of expressing pps leads 
to growth retardation (Patnaik et al (1 992) J Bacteriol 174:7527-7532). 

Plasmid pAROG was constructed by cloning a PCR fragment containing aroG 
pRW5tkt into the EcoRl-BamHl sites of pJFl 18EH. Plasmid pPS706 has been previously 
described in Patnaik et al. supra. Both plasmid express the respective genes under the Ptac 

1 5 promoter. The PCR fragment containing the glnAp2 promoter was cloned into the EcoKV- 
EcoRl sites of plasmids pAROG, and pPS706 to generate plasmids p2AROG3, and 
pPSG706, respectively containing the respective genes under the glnAp2 promoter. 

Host strain BW1 8302 (lacX glnL2001) was transformed with all four plasmids. The 
strains with the respective plasmids were grown in M9 salts-glucose media. Growth was 

20 compared after 5 hours. 

Table 2. Growth of Overexpressing Strains 



OD550 after 5 hours growth 


No plasmid 


-0.5 


Ptac-aroG 


-0.5 


glnAp2-aroG 


-0.5 


Piac-pps 


-0.12 


glnAp2-pps 


-0.4 



As previously demonstrated, overexpression of pps using P lac -pps caused marked 
growth retardation. However, the use of glnAp2 unexpectedly resulted in close to normal 
25 growth (Table 2). After 15 hours, proteins were isolated from each strain and analyzed on a 
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1 0% SDS-PAGE gel. At least 500% more pps protein was expressed when the pps gene was 
controlled by the glnA P 2 promoter compared to the P tac promoter. In another surprising 
finding, AroG protein, whose conventional overexpression is not overtly detrimental, was 
also at least 300% more abundant in extracts from cells utilizing glnApJ promoter for 
expression compared to the P, ac promoter. 

Production of Lycopene in E. coli by idi Overexpression 

We reconstructed a recombinant lycopene pathway in E. coli by expressing the genes 
dxs (coding for l-deoxy-o-xylulose 5-phosphate synthase) from E. coli, gps (coding for 
geranylgeranyl diphosphate (GGPP) synthase) from Archaeoglobus fulgidus, and crtBI 
(coding for phytoene synthase and desaturase, respectively) from Erwinia uredovora. These 
genes were inserted into pCL1920, a low-copy-number plasmid, to form pCW9, and 
simultaneously overexpressed. 

We used the glnApl promoter to control the expression of idi (isopentenyl 
diphosphate isomerase). Constructs containing the idi gene were derived from a promoterless 
vector, pJFl 18, The glnA P 2 promoter was inserted to form p2IDI. As a control, the P MC 
promoter was inserted to form pTacIDI. These plasmids were separately introduced into a 
glnL strain (BW18302) containing pCW9. The p2IDI-containing strain {glnAp2-idi) 
produced 100 mg L" 1 lycopene after 26 h in a defined medium containing glucose. The strain 
containing ? lac -idi on the other hand, produced only a small amount of lycopene, (< 5 mg L" 1 
) under identical conditions. Additionally, the p2IDI strain produced almost threefold less 
acetate than pTacIDI, which indicates that the carbon flux to acetate was being rechanneled 
to lycopene. 

Table 3. Carbon yield of lycopene formation in batch cultures of E. coli. 



Host only (BW18302) 
+ pTacIDI (Ptac-idi) 



Lycopene Carbon yield on glucose 
(mol C/mol C) 



0.0000 



0.0003 



+ pTacIDI (Ptac-W/) / pPS184 (Ptac-pps) 

+ p2IDI (glllApy-idi) 

+ p2IDI (glnApridi) I pPSGl 84 (glnAp : -pps) 



0.0012 



0.022 



0.014 
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Use of pps to Enhance Lycopene Yield 

pps was overexpressed from glnApJ from another compatible plasmid, pPSGl 8 while the 
remainder of the lycopene pathway (dxs, gps, crtBI) was expressed using pCL1920. 
Coexpression of pps and idi with the lycopene pathway increased the final titer of lycopene 
by 50% and caused the productivity to increase by threefold, from 0.05 mg ml/ 1 h" 1 to 0.16 
mg mL* 1 h" 1 (Table 3) This is in contrast to the companion strain containing both pTacIDI and 
pPS 1 84 (P tac -idi + Piac-pps), where no significant improvement in yield was observed and 
substantial growth inhibition occurred. 



Additional Host Cells for Lycopene Production 

The pykF::cat and pykA::kan alleles were introduced into a wild-type strain, in order to 
generate two single mutants (JCL1610 (pykF) and JCL1612 (pyL4)) and one double mutant 
strain (JCL1613 (pykF pykA)) (Ponce et al (1995) J Bacterioll 77:571 9-5722). The double 
mutant strain was able to achieve a final lycopene titer of about 14 mg lycopene/g dried cells, 
while the single mutant strains each obtained lycopene titers of about 2.5 mg lycopene/g 
dried cells. The single pyk mutants produced lycopene at a level similar to the wild type 
strain,- 4 mg lycopene/g dried cells. Further, overexpression of Pck, phosphoenolpyruvate 
carboxykinase , increased the final lycopene titer by about 3-fold. Overexpression of Ppc, 
phosphoenolpyruvate carboxylase, reduced lycopene production by about 30%. 



A number of embodiments of the invention have been described. Nevertheless, it will 
be understood that various modifications may be made without departing from the spirit and 
the scope of the present invention. Accordingly, other embodiments are within the scope of 
the following claims. For example, all homologs of the mentioned polypeptides and genes 
are within the scope of this invention. 
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